#preference datasets07/07/2025
SynPref-40M and Skywork-Reward-V2: Revolutionizing Human-AI Alignment with Scalable Reward Models
SynPref-40M introduces a huge new preference dataset, enabling the Skywork-Reward-V2 family of models to achieve state-of-the-art results in human-AI alignment across multiple benchmarks.